Context-Sensitive Statistics For Improved Grammatical Language Models

نویسندگان

  • Eugene Charniak
  • Glenn Carroll
چکیده

We develop a language model using probabilistic context-free grammars (PCFGs) that is “pseudo context-sensitive” in that the probability that a nonterminal N expands using a rule T depends on N’s parent. We give the equations for estimating the necessary probabilities using a variant of the inside-outside algorithm. We give experimental results showing that, beginning with a high-performance PCFG, one can develop a pseudo PCSG that yields significant performance gains. Analysis shows that the benefits from the context-sensitive statistics are localized, suggesting that we can use them to extend the original PCFG. Experimental results confirm that this is both feasible and the resulting grammar retains the performance gains. This implies that our scheme may be useful as a novel method for PCFG induction.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Statistical Language Modeling Using Grammatical Information

We propose to investigate the use of grammatical information to build improved statistical language models. Until recently, language models were primarily innuenced by local lexical constraints. Today, language models often utilize longer range lexical information to aid in their predictions. All of these language models ignore grammatical considerations other than those induced by the statisti...

متن کامل

On Vertical Grammatical Restrictions that Produce an Infinite Language Hierarchy

This paper introduces deriuation table.s that represent a complete grammatical derivations as whole in a vertical way. These tables are obtained by writing the consecutive sentential forms of grammatical derivations vertically one by one. The present paper places and discusses some restrictions on the columns of these tables. IVIore specifically, these restrictions constrain the order of contex...

متن کامل

Context-dependent factored language models

The incorporation of grammatical information into speech recognition systems is often used to increase performance in morphologically rich languages. However, this introduces demands for sufficiently large training corpora and proper methods of using the additional information. In this paper, we present a method for building factored language models that use data obtained by morphosyntactic tag...

متن کامل

The Dual Meaning Potential of Prepositional Grammatical Metaphor in Prose Fiction

From a Systemic Functional perspective, Grammatical Metaphor (GM) as is taken to be a chief driving force in the discourse of different genres, an important adult language machinery for ideational meanings to be semantically cross-mapped and realized through a different form in the stratum of the lexico-grammar, in order to convey changed meanings and tinker with the discursive flow and develop...

متن کامل

Grammar-based context-specific statistical language modelling

This paper shows how we can combine the art of grammar writing with the power of statistics by bootstrapping statistical language models (SLMs) for Dialogue Systems from grammars written using the Grammatical Framework (GF) (Ranta, 2004). Furthermore, to take into account that the probability of a user’s dialogue moves is not static during a dialogue we show how the same methodology can be used...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1994